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MOMENTS OF ZETA AND CORRELATIONS OF DIVISOR-SUMS: I 


BRIAN CONREY AND JONATHAN P. KEATING 


Abstract. We examine the calculation of the second and fourth moments and shifted 
moments of the Riemann zeta-function on the critical line using long Dirichlet polynomials 
and divisor correlations. Previously this approach has proved unsuccessful in computing 
moments beyond the eighth, even heuristically. A careful analysis of the second and fourth 
moments illustrates the nature of the problem and enables us to identify the terms that are 
missed in the standard application of these methods. 


1. Introduction 

We revisit the long-standing problem of determining the moments of the Riemann zeta- 
function on the critical line from a somewhat new perspective. On one hand, we have the 
detailed conjectures of [KS, CFKRS] which predict very precisely the outcome of any moment 
calculation. On the other hand, we have the conventional approaches of analytic number 
theorists via Dirichlet polynomial approximations to ({s) or ({s)^ which proceed by rehne- 
ments of the approximate Parseval formula for Dirichlet polynomials (i.e. the Montgomery- 
Vaughan formula). We would like to bring these approaches closer together. In particular, 
the Dirichlet polynomial approach completely fails when considering the 10th or higher mo¬ 
ment of ({s) on the critical line: it predicts negative values when the moments are clearly 
non-negative. We want to understand and rectify, at least from a heuristic perspective, this 
failing. 

Our aim here is to illustrate the nature and cause of this failing by a careful analysis of 
the second and fourth moments. We take the standard approach of approximating the zeta 
function by long Dirichlet polynomials and computing moments using divisor correlations, 
but depart in using longer polynomials than is normal. 

The approximate functional equation expresses the zeta function at a height T on the 
critical line by Dirichlet polynomials of length of the order of This allows all principal 
terms (called, in [CFKRS], the moment polynomials) in the second [I, A] and fourth [HB] 
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moments to be determined. This approach extends heuristically to the sixth moment, and 
to calculating the leading-order in the asymptotics of the eighth moment; see, for example, 
[CG]. For moments beyond the eighth, it leads to answers that are clearly incorrect, in 
that they are negative. In order to understand this problem better, we here examine the 
calculations for the second and fourth moments when we use longer Dirichlet polynomials; 
specihcally, we take the length to be substantially larger than T (and so do not utlize the 
approximate functional equation). We find that in this case the success of the standard 
methods when applied to the second moment rests on a particular identity, and that they 
fail when applied to the fourth moment by missing certain key terms. We believe that this 
sheds new light on the moment problem. 


2. Second moment 


We begin with the simplest case, that of the second moment of C('S)j even in this case our 
perspective reveals some new information. Our starting point is the theorem of Ingham [I] 
that 


C(<s + a)C(l ~ s + P) dt = 


C(1 + a + /3) -|- 


t 


—a—jS 




uniformly for small a and /5, where s = 1/2 + it. Sandro Bettin [B] has proven that, 
remarkably, the same formula holds for a much larger range of a and /3 namely |3fjQ;| |3fJ/9| < 
1/4 and IQ'al, \ ‘^/3\ T. 

We compare this result with what one gets when using the work of Goldston-Gonek [GG] 
on the mean-square of long Dirichlet polynomials. We refer to Gorollary 1 of [GG] together 
with example 1, which is about the case a = P = 0 of Ingham’s theorem. In particular we 
approximate (/(s -|- a) by a Dirichlet polynomial with X substantially larger 

than T , say X = with 1 < 6 < 2; the error in such an approximation is X^^‘^/T 
when t T so that (/(s) is well-approximated point-wise by this long Dirichlet polynomial. 
We introduce a smooth weight function which is real and compactly supported, say on 
[1,2], and then evaluate the mean square of ( by evaluating (as [GG] do) 


^{i/T)\C{l/2 + it)\‘^ dt 



mT) 


n<X 


1 


2 


^1/2+it 


dt + 0{X^/^ logT). 


Using coefficient correlations, Goldston and Gonek prove that this is 


= i/>(0)T V - + 23fJT f \-U(v)— -h 0 (TTX ^/2 ^ x/T)) 

Jy V 


y = 


T 

2^’ 


where 
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[t] is the floor of t, i.e. the greatest integer less than or equal to n, and the Fourier transform 
is defined by 


= / il){u)e{uv) du 


where e{x) = exp(27rix). 

On the other hand, the mean square of ^ is [A] 


T 







dt + 0{X^/^). 


Using the fact that 

^i = logX + 7 + 0(l/X) 

n<X ' 


it follows that 

( 1 ) 




(log uy + 7 ) V^(m) du 


as 1 / —>■ O’*". In the present formulation, this is purely a statement about a relationship 
between a function and its Fourier transform for real, compactly supported functions 'ijj. 

We note that establishing the compatibility between the result of [GG] and that in [I, A] in 
the approach taken here relies on proving equation ([T]). This we do below, as a consequence 
of a more general identity. 


2.1. General a and /3. If we do this with small shifts a and fd we have, again with s = 

1/2 + it, 

C{s + a)C{l - s + (3) dt = ^C(l+ « + /?)+ (^^^ C(l-«-/^) 


by Ingham. By [GG] this is also equal to 

f hT\ 


mr 


n<X 


n 


1+a+y 


+ 2T 


E 


dv 


y h<^ 

— y 


\27rv ) 


+ 0{T\X/T + X^/^)) 


where y = 


2-kX' 


Since 


E: 

n<U 


1 

= C{l + a + (3)-—^ + 0{l/U) 


1 + Q+^ 


Oi -\- j3 


for 5R(a + /3) > 0 and U > T/tt, it follows that 

r f rfiOL + fi \ 




dv 
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as 1 / —)■ O’*". In fact, this is an identity. Note that ([T]) is the limiting case when a,/? ^ 0 of 
this identity. 

When we pass to higher moments the corresponding identities will be consequences of 
conjectures rather than facts. Thus, it is desirable to give a direct proof of this now, one 
that can be imitated later. 


Proof. Write 6 = a+/3. We can extend the integration in v down to v 
is 0 for 0 < V < y. We will show that 

( 2 ) j ij{t) + dt = 2'^ 


= 0 since the integrand 
dv 


To begin with use 


Then write 



1 

27ri 



as + 5 ) 



ds. 


2^'f{v) = / '4>{u){e{—uv) -\-e{uv)) du. 


Because of the support of the u-integral only runs over a hnite segment of the positive 
reals. We now want to interchange the u-integration and the v- integration, but we have to 
be careful here. Basically we hrst split the u-integrand into two pieces, one with e{uv) and 
one with e{—uv) and we deform the paths of the two ensuing u-integrals by rotating the 
straight line paths along the positive reals onto the positive imaginary axis (in the case of 
the e{uv) integral) and the negative imaginary axis (in the case of the e{—uv) integral). In 
this way we get that the right hand side of (|2]) is 



/ C(« + ^)— 

J{2) S 





{-iv) 


s-\-S ^— 27 rvu 



ds du. 


The sum of the two u-integrals is 

2 cos ^^^^^(27r)-'*-^r(s + 

which is — s — 6) where x is the factor from the functional equation (^(1 — s) = 

x(l — s)C(s). Thus, our expression simplihes to 


( 3 ) 




C(l-s-<5) 


y * 




s 


ds du. 


Now we move the s-path of integration to the left to 3fJs = —cxo. When we account for the 
residues from the poles at s = 0 and s = —S we exactly obtain the left-hand side of ([2]). □ 


( 4 ) 


Note that we have shown that if > 0 then 

dv 


>0 


u* — = x(l ~s) ^ dt 
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for functions tjj compactly supported on (0, cx)). 

We conclude that one can calculate the second moment of the zeta function with a power 
savings on the error term using long Dirichlet polynomials (i.e. longer than in the approx¬ 
imate functional equation), but this relies on establishing certain identities, such as that 
proved above. We shall see that this becomes more challenging for the higher moments. 

3. Fourth moment 

The fourth moment of the Riemann zeta-function has been much studied with the explicit 
formula of Motohashi [M] one of the crowning achievements. 

The main goal of this paper is to revisit this topic but through the lens described in the 
last section. We want to understand how one can approach the fourth moment via the mean 
sqnare of a long Dirichlet polynomial approximation to C('S)^- This point of view has also 
been explored to a certain extent in the work [H-B] of Heath-Brown, which built upon that 
of Ingam and Atkinson [I, A]; see also [CG]. But each of the above used an approximate 
fnnctional eqnation to rednce the necessary length of the approximating polynomial. Here 
we want to see what happens purely with Dirichlet polynomials. This perspective will tnrn 
ont to be useful in a future discussion of yet higher moments and divisor correlations. 

4. The 4th moment of zeta 

To begin with we recall the fourth moment of zeta. Let s = 1/2 + it and snppose that 
a,/9,7, 5 (logT)“L Then 

(5) ^/J C{s + a)C{s +/3)C{1 - s +'y)C{l - s + 6) dt 

poo 

= T / dt + 0(T2/3+^), 

Jo 

where 

2,(a,/3,7,i) = Z(a,l3,j,S)+(^/Jj " Z(-^,|3,-a,6) + Z{-S,l3,j,-a) 

(6) +(i) ** Z(a.-'5,7,-/?) 

+ (^—j Z{-^,-S,-a,-l3) 

where 

ryf Q C(1 + « + 7)C(1 + ct +'^)C(1 +/^ + 7)C(1 + 

=- C(2 + a + /j + 7 + ^) - 

(Perhaps “recall” is not the correct word here since the above is certainly a theorem but its 
proof is not written down in fnll details anywhere.) 
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Now 


where 


We let 


and 


c(s+ q!)c(s +( 3 ) — 




n 


2s 


T, 


:= h “/c 


n=l 

-«1,-/3 


hk=n 


DaA^) = 




n=l 




Da,/3{s; X) = 




n<X 




be its approximating Dirichlet polynomial of length X. Let 

t 


/(T;X) = 4,^,,,,(T;X) = 


'ip 


T 


Da,ls{s,X)D^^sA - sA) dt. 


Our basic question is: How does I{T;X) approach A(t) C(s + «)C (5 + / 3 )C( 1 -s + 7 )C( 1 - 

s + d) dt as X oo? 


5. Descending through the recipe 

If we cheat, we can work backwards in a way and deduce the behavior of /(T;X) from 
that of /q°°'0 (A ^(5 + «)C('5 + /^)C(1 — s + 7)C(1 — s + 5) dt. The starting point is that by 
Perron’s formula 

X) = ^ [ + z)X^ — . 

3X1 J (2) Z 

So, then 

HT-,X) = -X- ff ridtp]D„,s{s + z)D.„s{l-s + w)dt-—. 

{2mYJJz,w Jo \TJ z w 

Now, if we move the paths of integration in 2; and w so that their real parts are positive but 
small, then we can use the formula ([5]) to evaluate the integral over f; however the imaginary 
parts of z and w can be large. We believe ([5]) holds uniformly when the real parts of the 
shifts are -Cl/ log T but the imaginary parts of the shifts can be large, as large as and 
even larger. This has not been proven to hold for (I5l), but has been proven, as mentioned 
above, by Sandro Bettin [B] in the case of the second moment. 

Now we use the fact that 

d3a,pA z') (s) 

and, assuming the uniform version of ([5]), replace ip A) DaA^ + z)D..^^s{l — s + w) df in 
the above by 

roo 

T / ip{t)ZtT{<y + z, f3 + z,'y + w, S + w) dt. 

Jo 
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Next, note that 

Zt{a + ^, + 2;, 7 + w, 5 + w) = Zt{a + ^ + w, /3 + 2; + w, 7, 5 ) 

follows just from the symmetries of Zt. We now change variables with s = z + w and have 
\ f f X'^ 

I{T]X) = / / — - -T / 'ilj{t)ZtT{a + s,l3 + s]^,5) dt dw ds + 0{T‘^/^^'-). 

y»s=4 J^w =2 ~ '^) Jo 

We move the path of integration in the ta-variable off to the left, to —cxo and collect the 
residue from the pole at tc = 0. In this way we have 

1 /* 

I{T- X) = — —T / 'iP{t)ZtT{a + s, /3 + s; 7, 5) dt ds + 

2717 j^s=A S Jo 


6. How MANY SWAPS TO USE? 

In the formula ([6]) we sort the terms according to the number U of “swaps”; this is the 
number of pairs of variables that are exchanged in the arguments of Zt. Specihcally, there 
are 6 terms on the right side of ([6]), the hrst has 0-swaps, the next four each have one swap, 
and the sixth and last term has two swaps. This notion is relevant because in the expressions 
for 


Zt{a + s,/3 + s;7 ,5) 

the terms with U swaps have factors of the form (^) . When paired with the factor X^ 

we see that to evaluate the relevant integral over s we should move the path to the right 
(and get 0 for an answer) whenever X < ; otherwise we should move the path to the 

left and evaluate the integral as the sum of the residues over the poles. 


7. Explicit formula: 0-swaps 

We refer to the terms with 0-swaps as the “diagonal” terms. These are just 

iP{ 0 )tX f ^z{a + s, (3 + s;'y,5) ds 
2717 s 
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which lead to T'^(O) times 

1 f (^(<5 + 1 + 0; + 7)C('5 + 1 + o + 5)C('S + 1 + /5 + 7)C('5 + 1 + /5 + (5) ^ 

27ii J^2) C(2>s + 2 + q; + /3 + 7 + (5) s 

^ C(1 + « + 7)C(1 + « + <^)C(1 + /^ + 7)C(1 + /3 + S) 

C(2 + a + /9 + 7 + (5) 

X-^-7 C(1 + (5 - 7)C(1 + /g - «)C(1 + ^-a + S-^) 

q; + 7 C{2 + f3 - a + 5 -'y) 

X-^-^ ({1 + 7 - (5)C(1 + - a)C(l + - a + 7 - (5) 

a + 5 C(2 +/9 — a + 7 — 5) 

X-z^-T- ((1 + /j - q;)((i + ^ - ^)((1 + ci-/3 + (5-7) 

/3 + 7 C(2 + q;-/5 + (5-7) 

x-z^-^c(i + «-/g)c(i+7-<^)c(i + «-/g+7-<^) 

/3 + (5 (^(2 + Q; — /3 + 7 — ( 5 ) 

plus an error term which is 


8. Explicit formula: one swap 


There are four terms with one-swap. One of these is 


l‘00 1 P T^s / tT\ 

Z(-j,P + s.-a-,.6)dsdt. 

li X < ^ this is 0. If X > — then we evaluate it as a sum over the residues at all of the 

ZTT TT 

poles (near s = 0) of the integrand. Note that 


Z{-j,/3 + s,-a - s,6) 


({1 - a - j - s)C{l - -f + S)C{1 - a + /3)C{1 + /3 + 6 + s) 
C{2 - a + f3 - -f + 5) 


which has poles at s = —a — 7 and s = —/3 — 6. The sum of the residues at these poles and 
at s = 0 give 


,0, r ftTY^-'Cil-a-jKil-j + mi-a + PKil+P + d) 

^ X-^-'T ((1 - ^ + (5)((1 - Q + + 

a + Y ({2 - a + f3 -'y + 5) 

X-^-‘ (tT \ ^(1 _ „ ^ ^ ^ ^ ^ ^ - g + /?) \ 

/3 + 6 \2n J C(2 - a +/9 - 7 -t-(5) / 

Note that the term in the integrand which is independent of t cancels with a term from the 
0 -swap diagonal piece above. 
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9. Explicit formula: two swaps 


For the term with two-swaps we have 


T 


f (Z) 


—a—0—'y—6—2s 


2Tii 


'( 2 ) 


V 271 / 


Z(— 7 , —5, —a — s, —(3 — s)— ds dt. 

s 


If > TT^X then the answer will be 0 as can be seen by moving the path of integration to 
the right. If < 47r^X then we nse the fact that 


Z(—7, —5, —a — s, —(3 — s) = 


C{1 - a - -f - s)C{l - /3 - -f - s)C{l - a - 6 - s)C{l - P - 6 - s) 
({2 - a - 13 - 'f - 6 - 2s) 


has poles at s = a + 'y,a + S,l3 + 'y and /9 -I- 5; so together with the residne from the pole at 
s = 0 we will get a total of hve terms for this integral: 


C (1 - g - 7)C(1 -P- 7)C(1 - a - 'i)C(l -P-^) 

\27iJ C{2-a-/3--f-S) 

X-^-^ /^y-/3+7-5 ^(1 + g - ^ _ ^)((1 + + 

a + 'j \2 ti) C,{2 + a — (3 +— 5) 

yyy-/3-7+^ C(1 - 7 + ^)C(1 + « - - 7 + <^)C(1 + a-f3) 

a + 5 \27r/ ^( 2 -|-a —/3 — 7 -|-5) 

X-P-^ ftT\ C(l-« + /g)C(l-a + /j + 7-<^)C(l + 7-^) 

a + 5 \2 'kJ ({2 — a + (3 +'y — S) 

X-P-^ ftT\ C(1 - « + /9 - 7 + <^)C(1 - 7 + <^)C(1 - ft + /g) 

(3 + 5 \2ti) C(2 - a d-/5 - 7 5) 


Notice that the last term here is the negative of the last term in the explicit formnla for 
one-swap. In the case that X > we add the 0-swap, one swap, and two swap terms 

all together and all of the dependency on X disappears; we recover exactly the formnla (1^ . 
This is to be expected since when X > it is known that the Dirichlet polynomial 

X) is a good point-wise approximation to C('S + a)C(s + /?), so the mean sqnares are 
the same. 


10. Ascending through gonvolution sums: the diagonal 


Next we turn to what can be said about the mean-square J(T; A) but now from the point 
of view of coefficient correlations. We call this the “ascending” perspective. We consider 


/^(T, X)= X)D^41 - s, X)ij{t/T) dt = T 

m,n<X 


Tr 


\/mn 


'ip{T\og{m/n). 
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The simplest terms are the diagonal terms: 


V'(o)r 




n<X 


n 


The sum here may be evaluated by Perron’s formula as 


n<X 


Ta,fi{n)T^^s{n) _ J- 
n 27ii 




'ra,p{n)T^^s{n) 


n 


l+s 


ds. 


But 


E 

n=l 


T-a,/3(n)r7,5(n) , o , .N 

--= Z{a + s, /3 + s; 7 , 6) 




and so the diagonal terms are identical to the 0-swap terms already considered above. 

11. Ascending: Type I off-diagonals with shifts 
Now we are left with the terms 




7n,n<X 

m^n 




mn 


We let m = n -|- h and obtain (just as in [GG]) 


2T3fJ 


Em' 


y h<^ 

— y 


-^jJ{T\og{m/n)). 


dv 




where the coefficient correlation is 

Ta,i3{n)Tj,s{n + h) = M{u, h) -h E{u, h). 


n<u 


We use the (5-method of [DPI] to get an expression for M'{u,h), the average value of 
En<« Ta,i 3 {n)T^, 5 in + K). Namely, 

OD 

Cq{h)Pa,f){u, q)P^,s{u + h, q) 

q=l 

where Cq{h) = '^d\h djj,{q/d) is the Ramanujan sum and where Pa^pi^u, d) is the average value 


d\q 


of 'nin<u^a,i 3 {n)e{n/d). The latter is just the sum of the residues near 1 of Vafi{s^ l/(i) 


u 


s-l 


where 


1/d) = y^ro,^(n)e(n/d)n T 


n=l 


In [G] Lemma 4, for example, it is shown that Da,pis, l/q)—q^ “ ^Da,/ 3 {s) is entire. Thus, 

PaAu, q) = ^C(l - a + (3)u-^q^-^ + ^C(l + « - (3)u-A^~" 
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and 

M\u, h) = f2 ^ (C(l - a + + C(1 + a - (3)u-^q^-‘^) 

q=l ^ 

X (C(l - 7 + 5)u-^q^-^ + C(1 + 7 - 5)u-^q^~^) • 

We multiply out this expression and have 4 terms each of which can be expressed in terms 
of 

^ C(1 + Q + /3)C(1 + 7 + ^) 1 

- ^(2 + « + /3 -f 7) ^ d}+»+0+i+s ■ 

' '' d\h 


Specihcally, we have 

M'(m, h) = M_s^fi.^-a + U~^~'^Ma-r-f},5 + U~^~^Ma-S;^-f3. 

Now, much as we did in the hrst section we compute 

V—\ , ,, / hT , N 7 / N dv 

h<vly 


23fJ 


(Note that the integrand is 0 for 0 < n < i/.) We have that one term (the term corresponding 
to of J2h<v/y h) is 


—a—7 


C,{1 - a + ld)C{l-^ + 5) f T \ .^a-'YJ-i-id-s 

«2-“ + '3-7 + i) \2.v) 

C(l-a + 7))C(l-7 + ^) ( T 1 


2tiv 


'V\S 

.y) 


2Tii 


7-5 + Cl + 7)C('^ 4“ 1/4 + 5) - ds. 

(2) s 


C(2 - cr +/4 - 7 + 5) 

As before, see ([3]), we compute that 

poo poo 

({s + a +'j) / dv = ({1 — s — a —'-y) / il){u)u~‘^~^~"' du. 

Jo Jo 

Then, by moving the s-path of integration to 3fJs = —1/2 we have 

1 f o.-Sy-s-a-'^ 

7r~ C{1- s-a- 7 )C(s + 1 + /4 + 6)- - ds 

271* 7(2) s 


= w-“-7(1-«-7)C(1 + /4 + ^) + C(1-« + /4-7 + ^) 


/«+7 


cr + 7 


-C(l - cr + /4 - 7 + 6 )- -—--+ 

p + 0 
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Now we have shown that 


23? 


^ h<vly 


''2'ipv^ 


( 2 ^j c(2-a + /3-7 + i) 


X M “ "^((1 - « - 7)C(1 + /3 + 5) + C(1 - « + /? - 7 + ^) 


y 


0+7 


a + 7 


„./3+(5„.—0+/3—7+<5' 

-C(l - a + /? - 7 + <5)^--) du + 0 ( 1 /^/^). 


Recalling that y = T/{2 tiX) we see that this term matches up with the first term (0) that 
we got from the recipe with one swap. Similarly the other three terms match up. It follows 
that the one-swap terms match up with the Type I off-diagonal terms. 


12. Conclusion 


The coefficient correlation terms that we have analyzed here contribute to /(T;X) once 
X gets larger than T. We have shown two very different ways to view these terms: from the 
1-swap terms of the recipe conjecture, and from the Type-I divisor correlation sums much 
studied in analytic number theory. Our results illustrate that there are other terms to be 
considered once X > T^, i.e. that there is something missing from the above analysis. These 
are the divisor correlations that must match the 2-swapped terms. 

The next step is to describe, from the ascending perspective, the terms that we see in the 
descent from ([5]) to the terms with 2-swaps. This will be the subject of a subsequent paper 
where the coefficient sums to be considered are much as in [BK]: 


E 

A/i ,Mn .h-i ,hn 


E 

mi ,m2 j^^i ,n2 

(*i)!C*2) 

mi m2^X 


—a —7 —0 —S 

rui Ui m 2 n2 

mim2 



\miMi 7x12X12 J J 


where 


(* 1 ) : Mirrii = M2ni + hi 


(*2) : M2m2 = Min2 + ^2 


for coprime integers Mi and M 2 . 
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